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NON-VANISHING OF DIRICHLET L-FUNCTIONS IN GALOIS ORBITS 


RIZWANUR KHAN, DJORDJE MILICEVIC, AND HIEU T. NGO 


Abstract. A well known result of Iwaniec and Sarnak states that for at least one third of the 
primitive Dirichlet characters to a large modulus q, the associated L-functions do not vanish at 
the central point. When g is a large power of a fixed prime, we prove the same proportion already 
among the primitive characters of any given order. The set of primitive characters modulo g of a 
given order can be described as an orbit under the action of the Galois group of the corresponding 
cyclotomic field. We also prove a positive proportion of nonvanishing within substantially shorter 
orbits generated by intermediate Galois groups as soon as they are larger than roughly the square- 
root of the prime-power conductor. 


1. Introduction 

Central values of L-functions are of fundamental importance in number theory. In particular, a 
host of results and conjectures, including the Birch and Swinnerton-Dyer Conjecture, the Riemann 
Hypothesis, and the Katz-Sarnak Density Conjecture, predict in various contexts that the central 
values of L-functions (or their derivatives, as appropriate for root number reasons) hold key arith¬ 
metic information and should vanish only when there are deep arithmetic reasons for them to do 
so and that this should be an exceptional occurrence in suitably generic families. 

Introduced by Bohr and Landau [3] in their study of zeroes of the Riemann zeta-function and 
notably used by Selberg m in the course of proving that a positive proportion of these zeroes 
lie on the critical line, the “mollifier” is the most versatile tool used in analytic number theory 
to prove the non-vanishing of central values of L-functions in families, often achieving a positive 
proportion result. What is by now a classical result using the mollifier is one of Iwaniec and Sarnak 
cni, concerning non-vanishing in the family of Dirichlet L-functions. They proved that for at least 
(i — e) of the primitive Dirichlet characters modulo q, where q is any integer sufficiently large in 
terms of e, the central value L(i,y) is not zero. This is currently the best known result that can 
be proved with a “one-piece” mollifer. Earlier, Balasubramanian and Murty [1] had established 
a smaller positive proportion of non-vanishing, and recently, Bui [5] proved, using a “two-piece” 
mollifier, that about 34% of the central values in this family do not vanish. When one restricts 
to the quadratic Dirichlet L-functions, Soundararajan [21] established that for at least | of the 
fundamental discriminants |(i| < X, the central value L(i,(-)) is not zero, as A oo. It is 
generally conjectured (see the discussion in |3T]) that 

( 1 - 1 ) L(hx)^0 
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for any primitive character y. Such a statement appears to be substantially beyond the reach of 
currently available technology. 

Let f be a primitive root of unity, where (j) is the Euler totient function. The Galois 

group G = Gal(Q(.^)/Q) acts on the set of primitive Dirichlet characters modulo q as follows. 
For a G G, we define to be the character given by x‘^(n) = a{x{'n)) for all integers n. The 

Galois action partitions the set of characters into orbits, which are particularly natural from the 
arithmetic point of view, since the associated Dirichlet L-functions (by definition) share the same 
field of coefficients. One is led to wonder whether a positive proportion of non-vanishing can be 
proven for Dirichlet L-functions within each orbit. The aforementioned results m and [5] do not 
preclude the possibility that L(i,y) = 0 for all y in some orbit O. 

As a positive proportion statement toward (EH) in this context, we conjecture that for some 
c > 0, we have that 

(1.2) jL x: l>c-e 

' ' xeo 

for any e > 0, any orbit O of cardinality \0\ > q^ , and any integer q sufficiently large in terms of e. 
One might hope to match the best constant c = 0.34 currently available for the full set of primitive 
Dirichlet L-functions or the more classical proportion of Iwaniec and Sarnak. We establish the 
latter in the case that g is a large power of a prime. 

Theorem 1.1. Let q = for an odd prime p. For any e > 0 and k large enough in terms of e 
and p, we have that lil.2\} holds with c = ^. 

When q=p^^ the orbits under the Galois action can be described as follows. First recall that a 
necessary and sufficient condition for the existence of a primitive character modulo p^ having order 
I is that I = p^~^d for some d \ (p — 1). The set of all primitive Dirichlet characters modulo q whose 
orders equal p^~^d forms an orbit O of cardinality 

(1.3) |0| = 

and every orbit arises in this way. Thus O depends on d but we suppress this in the notation. For 
these facts, see, for example, [71 Chapter 5] and [51 page 16]. 

Although, from an analytic perspective, it may appear that we are dealing with a family only 
slightly thinner than the original unitary family of all primitive Dirichlet characters, in reality this 
gives rise to a significant difficulty, which we describe below in the introduction. The key feature 
here is that the family is “thinning out” in a thoroughly arithmetic (rather than analytic) way. In 
fact, the same device that we use to overcome this basic difficulty subsequently allows us to prove 
a positive proportion of non-vanishing in substantially smaller “thin orbits” in Theorem II.31 below. 

The problem of studying the non-vanishing of L-functions within Galois orbits is a natural one 
that has yielded some of the strongest known results in the subject. Let q = p^ for the remainder of 
this paper. Let / be a holomorphic newform of weight 2 and level coprime to p which has rational 
Fourier coefficients (equivalently, / is associated to an elliptic curve over Q of conductor coprime 
to p, by [551 Theorems 7.14, 7.15] and [231 [HI II])- Let L{s,f x y) be the L-function of / twisted 
by a primitive Dirichlet character y modulo q, having central point s = 1 by a functional equation 
normalized so as to relate L(s, / x y) and L(2 — s, / x y)- Rohrlich [16] showed that L(l, / x y) does 
not vanish as long as k is large enough in terms of p and /. To prove his result, Rohrlich appealed 
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to an “algebraicity” theorem of Shimura [nnn Theorem 1], which implies that if L(l, / x y) = 0 
then L(l, / x x°’) = 0 for all a € G. By this, if the sum 

(1-4) ^i(l,/xx) 

xeo 

is nonzero, then every summand is nonzero. Rohrlich found an asymptotic for dUD for any orbit 
O when k is large enough in terms of p and /, and showed that the main term is indeed nonzero. 
Chinta [6] extended Rohrlich’s work to the case of prime moduli q by considering instead the sum 

(1.5) ^i(l,/xx)M(/xx), 

xeo 

where M{f x x) is a truncation of the formal Dirichlet series for L(l, / x x)~^- Th® effect of this 
mollifier is that each summand of (1131) is “morally” close to 1, and this allows Chinta to show that 
the sum is nonzero provided \0\ > and q is large enough in terms of /. In particular, this 

implies the non-vanishing of T(l, f x x) over big orbits when g is a large enough prime in terms of 
/, a fact which does not follow from Rohrlich’s result. 

All these results for twists of elliptic modular L-functions rely heavily on the algebraicity results 
of Shimura. Such a route is not available in the present context of central values of Dirichlet 
L-functions. 

We establish Theorem [O] by evaluating the mollihed moments 


(1.6) 

Hhx)Mix) 


xeo 

and 


(1.7) 

^|L(4,x)nM(x)p 


X60 

where 


(1.8) 

M(U = ^ 


is a molliher of length q^, for some 9 > 0 and coefficients Um satisfying Um ^ and oi = 1. We 
are able to obtain asymptotics with a power-saving error term for arbitrary mollifiers when 9 is any 
fixed constant satisfying 0 < 0 < see Propositions 12.41 and 12.51 As discussed in section [231 this 
allows us to deduce Theorem 11.11 with the proportion of non-vanishing c = | by taking 0 —>■ 4. 

Although |L(i,x)p can be considered to be analogous to L{l,f x y), our problem has some 
important differences from the one considered by Rohrlich and Chinta. Firstly, the additional 
factor |M(x)p in ()1.7I1 makes our problem more complex. This is readily seen when comparing 
with dUD, and to compare with dESD we note that Chinta’s method only works for a specific 
mollifier while ours works for an arbitrary mollifier, and that M{f x x) has length at most q^~^ 
(see [6l pg 22]) while |M(x)p has length at most q^~^. However one must keep in mind that Chinta 
proves a result which works for q prime, while ours does not. 

The second difference is the method of proof. To evaluate (11.41) . Rohrlich had to consider the 
averages 

(1-9) X(n) 

xeo 
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for n < while for dm we must consider the averages 

(1.10) x: X(nimi)x(n2m2) 

X 60 

in which nin 2 < and mi,m 2 < g^- As we will see, dLll) is zero unless ^ = 1 mod ^. 

k-l 

From this, Rohrlich could immediately conclude that n = 1 orn > pp-i, thereby effectively isolating 
the contribution of the diagonal term n = 1. In contrast, (11.101) is zero unless 

(1.11) = {n2m2Y~^ modp^“^. 

Writing nimi = ^n 2 m 2 uiodp^, it is now much harder to isolate the contribution of the diagonal 
terms riimi = n 2 m 2 (that is, C = 1 modp^); a priori it is, for example, perfectly plausible that 
nimi and n 2 m 2 could be fairly close to each other without actually being equal. To isolate the 
contribution of the diagonal terms nimi = 71 . 2 ^ 2 , we will appeal to the p-adic version of Roth’s 
theorem, Lemma l2.6l below. The upshot is that, keeping in mind that = 1 modp*’, having two 
solutions to (11.111) within the same class of ^ ±1 mod p^ too close to each other would ultimately 
yield too good of an approximation in the p-adic norm to a (p — 1)*^ p-adic root of unity. 

It is interesting that Roth’s theorem, a deep result from diophantine approximation, should be 
used to prove the non-vanishing of L-functions. This connection has been made before in different 
contexts by Rohrlich [TS] and Greenberg [5]. Our paper offers another such example and it seems 
to be the first one involving a family of Dirichlet characters as well as the first one involving a 
genuinely second mollified moment. 


Taking 0 = 0 in our evaluation of (jl. 6 |) and (ll.7|l yields in particular the first and second moments 
of T(^,x) over Galois orbits. Before stating this result, we note that the value x(~l) is the same 
for every character in any given orbit O, as it is a rational number. 


Theorem 1.2. Let q = p^ for an odd prime p and let O be any Galois orbit of primitive Dirichlet 
characters mod q. Suppose that x(~l) = (~1)^ for any x S O. We have that 


' ' xeo 

^El«l.x)P = ^(.og(i) 

' 'xeo ^ 


r'(^) 

r(i^) 


-k27 + 2 


logP \ 

p — 1 / 


+ 0 (g-H^), 


for any e > 0, where the implied constants depend on e and p, and 7 = 0.57721 • • • is the Euler 
constant. 


We remark that the implicit constants in all our main results are ineffective due to their depen¬ 
dence on p-adic Roth’s Theorem. However, the first moment in Theorem 1.2 and the mollified first 
moment in Proposition 12.41 can also be evaluated without recourse to p-adic Roth’s Theorem, at 
the expense of the error terms 0(g~3“'"'^) being replaced by the weaker but effective error terms 
0{q~ 2 (p-i) +'^). This will be shown in the course of the proof of Proposition [2Tl 


Finally, we address the refined question of non-vanishing in smaller sub-families within the Ga¬ 
lois orbits of primitive characters modulo g. A rather natural sub-family emerges when considering 
orbits of primitive characters under various subgroups H of the Galois group G = Gal(Q(.f)/Q). 
These subgroups form a partially ordered set (corresponding by Galois theory to the tree of inter¬ 
mediary field extensions Q < AT < Q(C))i ^^.d, as the subgroup H varies from G through its various 
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subgroups to the identity, the corresponding orbits of a fixed primitive character modulo q can be 
seen as interpolating (or shrinking) between its full Galois orbit, considered in Theorem ll.il and 
the individual character. 

We describe these “thin orbits” explicitly in cases of our interest. For every 0 < k < fc —1, denote 
(Note that the field is independent of k.) Since : Kq] = Xp 

4>{(j}{q)) and we are primarily concerned with the case of fixed p and large k, we focus here on the 
tower of these intermediate fields 


Q(e) = Kk-i ^■■■2Ko^Q. 

The Galois group Gal(Q(^)/Q) acts transitively on any given Galois orbit O. The intermediate 
Galois group Gal(Q(^)/itrfc_i_K) therefore acts on O; we call an orbit of this action a thin Galois 
orbit, and we write Ok, for any one of these thin orbits. Note that already the thin orbits Ok-i 
refine the full Galois orbits O, with thin orbits Ok for smaller k being progressively smaller (so 
that we may think of the parameter k essentially as an indicator or the logarithmic size of the 
corresponding thin orbits), all the way to the extreme case of n = 0 , which corresponds to the 
single primitive characters. 

It is not difficult to see that a G Gal(Q(^)/Q) satisfies a G Gal(Q(.^)/itrfe_i_re) if and only if 
( 1 . 12 ) cr( 0 =r for some a = 1 mod/-^-''(p- 1 ). 


We thus see that, equivalently, two characters Xi,X 2 are in the same thin Galois orbit Ok if and 
only if has order dividing p” (informally speaking, if xi and X 2 differ by an “algebraically 

simpler” character, one that is “shallower” in the sense of the “depth aspect” of modulus q = 
with large k). In particular, all thin orbits Ok for the same k are of equal size given by (keeping in 
mind that (a, </>((?)) = 1 ) 




p”, 0 < K < fc — 1, 


Another, more “analytic”, way to think about the thin orbits Ok is provided by the explicit 
characterization of the “principal part” of the dual of the group (Z/qZ)^ for a high prime power 
q = p^’, which is essentially due to Postnikov [13]. Let = (Z/p^Z)^, and let tTq denote the power 
map TTa : Xk —>■ Xk, [a;] i-A [a;“]. Gorresponding to the decomposition Xk = Xko x Xki, where Xko = 
imTTpfc-i = kerTTp-i, |Aifeo| = P — 1 , and Xki = im 7 rp_i = kerTTp^-i = {[x] € Xk : a; = 1 modp}, 
we have the canonical decomposition of dual groups Xk = Xko x Xki- Let logp denote the p-adic 
logarithm and ipix) denote the “standard” additive character i/; : Qp —>■ such that its kernel is 

exactly Zp and that for x € Z[l/p] C Qp n K. According to Postnikov’s lemma (for 

p > 2; see also [12] Lemma 13]), every character x^^^ G ^ki is of the form 


X^^^ (l+pt) = xi^^ (1 + pt) = -0 


- ^( 1 ) 


oo log (1 +pt) 


for some a = aop~^ G p“^Zp/p“^Zp, with primitive characters corresponding to a G p“^Zp lp~^'Lp. 

The isomorphism of p~^'Lp jp~^'Lp —> Xki given by a i-A Xa^^ induces a metric on Xki via 
d(Xa^\xl^^) = |n ~ ^p|P = oond(xa^^xi^^) for xi^^ ^ xl^^- Relative to the above decomposition of 
Xk and this metric on Xki, the thin orbit Ok containing a character x = is, for ft > 0 , 

precisely the set {x^°^} x B[x^^\p'^]', here B[x^^\p'^] denotes the closed ball in Xki with center x^^^ 
and radius p” with respect to the above-defined metric. In particular, all characters in a thin orbit 
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Ok share the same X^o-component and their X^i-components are all close to each other, with the 
corresponding neighborhood around a fixed character x shrinking as k decreases. 

From the point of view of harmonic analysis, we see clearly the basic difficulty of isolating 
individual € Xi~o in our orbits (which, in a modified form, is already present in isolating the 
full orbits O), which on the dual side is reflected by the initial survival of the {p— 1)**' roots of unity 
in Lemma 13.11 followed by isolating characters in smaller neighborhoods within Xki: which 
corresponds to the survival of further terms in more permissive congruence classes containing these 
roots of unity. 

Our techniques, which ultimately rely on the impossibility of overly good p-adic approximations 
to algebraic integers, are very well suited to the study of thin orbits of primitive characters to prime 
power moduli and give the following refinement of Theorem 11.11 

Theorem 1.3. Let q = for an odd prime p. For any e > 0 and k large enough in terms of e 
and p, and for any k > k/2, we have that \1.2\) holds also when O is replaced by any “thin orbit” 
Ok, with 

K/k — 1/2 
” K/k + 1/2 

To keep the article light and readable, we present our arguments in the context of Theorem ll.il 
first and then indicate the adjustments needed for the proof of Theorem 1 1.31 in Section [331 


2. Preliminaries 

Notation. Throughout the paper, e > 0 denotes a parameter which may be chosen to be as small 
as we like, but need not have the same value from one occurrence to another. The letter p denotes 
an odd prime and q = p^. We use to denote the set of {p — 1)*^ roots of unity in the p-adic 
integers Zp. All implicit constants may depend on e , p and the parameter 6 introduced in (11.81) . 
but not on k. 


2.1. Approximate functional equations. We have the following standard approximate func¬ 
tional equations. 


Lemma 2.1. For a primitive Dirichlet character y modulo q, let l be defined by y(—1) 
and let 


( 2 . 1 ) 


U{x) 

V{x) 


1 

2711 

1 

2771 


[ r(^ + i) 

+ 2 ) r(^ + i) 




ds 

5 

s 



r(^ + i)^ 
r(Ui)" 


(ttx) 


ds 

s 


We have that 


(-I)S 


(2.2) Uix) <0 a;”' 

for any x,c> 0. For any X > 0, we have that 

X{n) 

1 

712 


(2.3) 

(2.4) 


Hhx)=j:^u 


T>1 


|L(i,y)P = 2 ^ 

m ,n 2 > 1 


V{x) <^cX “ 

(/k)+ 0 (,-»), 

X{ni)x{n2) y f nin2 \ 

(nin2)2 ^ 92’ 
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where the implied constant in (lOl) depends on A. 

Proof. The first equation (I2.3|) is established by the functional equation of L{s,x), which may 
be found in [SI Theorem 4.15], together with jS) Theorem 5.3], in which we take G{u) = 1 and 
X = With this choice of X, the second sum in |9l (5.12)] may be bounded by The 

second equation is established by applying (SJ Theorem 5.3] to the product T(s, x)T(s, y), rather 
than to each factor individually, with G{u) = 1 and X = 1. The estimates (|2.2|) may be found in 
0 Proposition 5.4]. □ 

The sums in (12.31) and (12.41) are essentially restricted to n < q^+^+'^ and nin 2 < by (12.21) . 


2.2. Character averages. In this section, we record the orthogonality relations provided by av¬ 
eraging over the family of Dirichlet characters in a Galois orbit; see Lemma 12.31 below. We start 
with a familiar auxiliary result. 

Lemma 2.2. Let m and k > 1 be integers. If p^ \ {mP — 1), then p^~^ \ (m — 1). 


Proof. The claim is trivially true for m = 1, so assume to > 1. Since to = ml = 1 mod p, we may 
write TO = 1 -|- p^t for some r ^ 1 and some integer t with p\t. Then we have 

mP -I = {l+ pG)P - 1 = p^+^t mod p’'+2, 

so that is the highest power of p that divides to^ — 1. In particular, since p^ \ {mP — 1), we 
get r -I- 1 ^ fc, and so r ^ fc — 1 and p^~^ \ (to — 1). 

□ 


Lemma 2.3. Let q = p^ for an odd prime p and let O be any Galois orbit of primitive Dirichlet 
characters mod q. For any integer n, we have that 


(2.5) 


unless 

( 2 . 6 ) 


xeo 


nP ^ = 1 mod p^ ^. 


Proof. Suppose that O has cardinality given by (1E3, for some d \ (p — 1). We first show that (12.51) 
holds unless 

(2.7) nP^P~^^ = 1 mod q. 


It was shown in 0 pg 17] that 

( 2 . 8 ) 


1 


xeo 


p(ord(n*’<i^)) 
^(ord(n^)) 


where ord('n^<i ) denotes the multiplicative order of n’’^ in the group {fLjqE)'^ . Since \{fLlq£)'^ \ = 
pfc-i(p _ 1 )^ if (12.71) is not satisfied, then p^ divides ord(n^), and so (12.81) is zero. 

Now if (12.71) holds, then Lemma [221 implies, by taking to = that (12.61) holds. □ 


Note that the condition (12.61) . which is all we will need from our orthogonality relations, requires 
less information than what is provided by the full sum over all y € O or the explicit evaluation 
(|2.8I) . This will be transparent in Lemma 13.11 which features a thinner average and for which we 
provide an independent proof. 
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2.3. 

(2.9) 


Mollifiers. The starting point of the mollifier method is the observation that 

2 


1 


E 1 

X 60 

i(5.x)/0 



E 

xeo 


1 > 




The second inequality above is the Cauchy-Schwarz inequality. The next step of the method is to 
evaluate the mollified moments. We prove the following. 


Proposition 2.4. Let q = for an odd prime p and let O he any Galois orbit of primitive Dirichlet 
characters mod q. For 0 < 6 < 1 in we have that 


| 0 | 


^ L(i,x)M(x) = 1 + 0{q-i+^ + q-i+-^+^). 


X60 


Proposition 2.5. Let q = p^ for an odd prime p and let O be any Galois orbit of primitive Dirichlet 
characters mod q. Suppose that x(~l) = (“1)*" for any x G O- For 0 < 0 < ^ m (11.81) . we have 
that 






\ \ 'Km^mo / / 


| 0 | 


xeo 


P e [rr>-i,m2] 

{mim2,q) — l 


7rmim2 


+ 0{q-i+^ +q-i+^+''), 


where {mi,m 2 ) denotes the greatest common divisor of mi and m 2 , [ 7711 , 7712 ] 


r'(i 


least common multiple of mi and m 2 , and C = 


1+27 + 2 ^. 


mima denotes the 

(mi ,m2) 


The final step is to insert the main terms of Propositions 12.41 and 12.51 into the ratio on the right 
hand side of (1^ . and then choose the coefficients Om so that the ratio is maximized. We note that 
the main terms of our mollified moments are identical to those in Iwaniec and Sarnak’s problem 
(see [lOL (5.5)]) because in both problems the main terms arise from the diagonal contributions. 
That is, if x(“l) = (“!)“' foi' all x £ we have for any 0 < 0 < ^ and some <5 > 0 depending on p 
and 0 that 

E" L{\,x)M{x)^q-\ 

' ' xeO r yi) ^ ^ 

x(-i)=(-i)‘ 

' ' XeO ^ X mod q 

where means that summation is restricted to the primitive characters and (p*{q) = X)*x mod q 
The optimal choice for Om will therefore be the same as in Iwaniec and Sarnak’s problem (and we 
will not repeat the proof of this here), which yields that the ratio on the right hand side of (12.91) is 
+ 0 ( 1 ) as k ^ 00 . Taking 6 ^ Theorem 11.11 thus follows from Propositions 12.41 and 12.51 


2.4. Roth’s theorem. The following lemma is a special case of a result of Rohrlich [T5j Proposition 
1, pg 401]. It is a consequence of the p-adic version of Roth’s theorem, due to Ridout [14] . 
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Lemma 2.6. Let ^ S Zp and 0 < 6 < Suppose that fj is algebraic over Q of degree at least 2. 
Then, for sufficiently large k > kQ(/3,S), there are no nonzero integers a and b which satisfy 

\a - bl3\p < 

and 

\a\,\b\<{p>^)i-^. 


Proof. In [m Proposition 1, pg 401], take ap = 1 and /3p = /3. Note that condition (ii) directly 
above the proposition is satisfied because /3 ^ Q. Note that /cq is not computable. □ 


When [b,p) = 1, this result says that an approximation of the algebraic number /3 by | to an error 
within (max{|a|, is too good to exist. Note that there are infinitely many approximations 

of any /3 € Zp up to within <C (max{|a|, 1^*1})”^ by a p-adic incarnation of Dirichlet’s Approximation 
Theorem (see also [5] for an analogue of the Farey dissection in this context), and it is a hallmark of 
Roth’s theorem that the exponent in Lemma is essentially the best possible. For an interesting 
investigation into existence of algebraic numbers exhibiting approximability by rationals within the 
transition range between the exponents —2 and —2 — 5, see m- 

We stress that the p-adic analogues of previous partial results toward Roth’s Theorem, such as 
the theorems of Liouville, Thue, Siegel, and Dyson, do not (except for a few small primes p) suffice 
to obtain an asymptotic in the situation of Proposition [5^ or Theorem [T^j with our methods. This 
is so because, for example when estimating (I3.12p . in order to obtain (13.131) with a o(l)-upper bound 
on the right-hand side for large q, we need to be able to take Q (essentially the allowable length of 
intervals in Lemma [2771 below) to be at least (p^')*“''^, whereas all results prior to the actual p-adic 
Roth’s theorem furnish only (p'=)°p(i). 

We will also use the following convenient implication of Lemma 12.61 

Lemma 2.7. Let 0 < <5 < ^, let k > ki{S) be sufficiently large, and let Ak and Bk be intervals in 
the rational integers of length at most (p^') 2 Then there are at most p — 3 pairs (a, b) G Ak x Bk 
such that 

{ab,p) = 1, 


( 2 . 10 ) a ^±6 mod p^ 
and 

(2.11) aP-^ - bP-^ = 0 mod p'^-f 


Proof. Suppose that a and b satisfy the conditions of the lemma. The congruence (12.111) implies 
that the p-adic integer aP~^ — bP~^ has norm 


( 2 . 12 ) 


aP-i 

Ip 


n (®“^c) 


p 


<p 


-fc+i 


Recall that the roots of unity in are distinct modulo p. By this fact and the assumption 

{ab,p) = 1, we have that a — b(f and a — bC,' are distinct modulo p for C 7 ^ C■ Thus, by (12.121) and 
(|2.10|) . for some C, G Pp_;^\{±l} we have 

\a-bC\p<p-^+\ 
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Now suppose for a contradiction that at least p — 2 pairs (a, b) satisfy the conditions of the lemma. 
Then by the argument above and Dirichlet’s Box Principle, we have for some C £ 
least two distinct pairs (ai, &i), ( 02 , 62 ) £ Ak x Bk satisfying 

\aj - bjClp < U = 1 , 2 ). 

It follows from the strong triangle inequality that 

1(02 - oi) - (62 - 6 i)C)lp < max{|ai - 6 iC|p, |a 2 - hClp} < ■ 

We also have, by the assumption on the lengths of Ak and Bk, that 

|a2 - Oil, I62 - 6i| < {p^)^~^. 

Thus by Lemma l2.61 in which we take j3 = C,, we deduce that for k > ki sufficiently large, where fci 
is not computable, we must have 02 — Oi = &2 ~ = 0. This is the desired contradiction. □ 


3. Proofs of the main results 


3.1. Proof of Proposition [274l In Lemma [2.11 let A > 0 be such that A + 0 < 1. We have that 


(3.1) 


1 


^L(i,x)M(x)= ^ 




1 


{nm)i Vgi+'^/ |0| 


^ X{nm) + 0{q 


- 99 \ 


xeo m<qO,n>l ^ ' ' X^O 

Note that the exchange of summation above is valid because although the function U{x) depends 
on x(~l), this value is the same for every character in O. Now, by Lemma 12.31 we have that 

X! = 0 


| 0 | 


xeo 


unless {nm)P~^ = 1 modp*^“^, in which case we have by Hensel’s Lemma that nm is congruent 
to one of (( 1 ,..., Cp-i £ Bp-i modulo Separating the term nm = 1 for the main term in 

Proposition 12.41 and using also the divisor bound, we have that (13.111 equals 

(“) S E »-’)+o(<,- 1 : 1 : 

l<s<9 i<3<P-l 

8=0 mod p'““^ s= 0 modp'““^ 

By shifting the line of integration in (12.11) to 3?(s) = — | + e, we see that the main term above equals 

1 + 0 ( 9 “^+*^). 

The summands in the first error term are particularly sensitive to the size of s. From the 
condition that = 1 modp''”^ and s > 1, we immediately have that s > pp-^, and the first 
error term is seen to be 

without recourse to p-adic Roth’s theorem. However, we can improve this estimate by appealing to 
Lemma [2.61 The terms with s = ±1 modp^“^, s > 1, contribute As for the terms 

corresponding to s = Q ^ ±1 modp^'“^. Lemma HH] with (3 = Q guarantees that, for sufficiently 
large fc, there are no values of |s| < (p^) 2 with |s — Q ■ l|p < p“^+^, so that all these terms must 
in fact satisfy s > {p^)^~^, and in total the first error term is 
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Finally, writing s = Q + ^r, we have that the second error term above is bounded by 

q 2_^ [qr) = < g ^ ■ 

l<r<q^+^+^ 

Taking the positive A and 6 to be as small as we like and adjusting the implied constants completes 
the proof. □ 


3.2. Proof of Proposition [275l By Lemma [2.11 we have that 


(3.3) 


^5:iL(ix)nM(x)p 

' ' xeo 


= 2 E E 

mi ,Tn2<q^ t 


Q-mi 07712 ^ 

(nin 2 TOim 2 )^ ^ q ' \^\ 


E Xinimi)x{n2'm2). 

x&O 


We write this as a sum of diagonal terms and off-diagonal terms, 

E + E • 

7ii?7ii=n2?7i2 nimi^n2m2 


The diagonal. We first consider the diagonal terms. The equality nimi = n 2 m 2 is the same 
as requiring ni = rm 2 /{mi,m 2 ) and n 2 = rmxj for some r G N. Thus the diagonal 
contribution is 


E =2 E 


Ottii dr] 




r mim 2 


[ 7711 , 7712 ] r V 5 ( 7771 , 7772 ) 

nimi=n2m2 mi,m2<q“ ’’>1 


r)- 


The innermost sum above can be evaluated by the calculation in [101 Lemma 4.1] and equals 


P- 1 

2p 


(log( 


5 ( 7771 , 7772 )' 

7777111712 


)+c)+o(( 


5(7771,7772)' 

77117712 




where C is as in the statement of Proposition 12.51 The main term above gives the main term of 
Proposition 12.5[ and the total error is less than 


E 


1 


mi ,m 2 <q'^ 


/ 5 ( 7771 , 7772 )' 

[7771,7772] V 77717772 


«5 


— o+€ 


E 


1 


mi ,m 2 <g^ 


( 77717772 ) 2 


<5 




The off-diagonal. We now turn to the off-diagonal contribution, which we must bound by a 
negative power of q. By Lemma E751 and (12.21) . we have that 


E « 

nimi7^n2m2 


E E 

mi,m 2 <<j^, nin 2 <<?^'*'® 
(nin2mim2,p) = l 
(nimi)^“^=(n2m2)^~^ inodp^”^ 
nimi^n 2 m 2 


1 

(77i772777i7772)^ 
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Writing a = nimi and b = 7121712 and splitting the resulting sum above into dyadic intervals 
^ < a < 2A and B < b < 2B, it suffices to show that the sum 


(3.4) 


{AB)^ 


E 

A<a<2A 
B<b<2B 
{ab,p) — l 


a^b 


is less than a negative power of q for 
(3.5) 

We split (13.41) further as 


1 <AB <q 


l+2fl+e 


(3.6) 


{AB)i 


E 


A<a<2A 
B<b<2B 
{ab,p) — l 
a=±6 mod p^ 
a^b 


{AB)i 


E 


1 . 


A<a<2A 
B<b<2B 
{ab,p) — l 

oP-i=bP-i modp'“-i 
mod 


In the first sum above, we must have 2A > ^ or 2B > p^ In the first case, for each of the 

B choices for 6, there are O(^) choices for a by the Chinese Remainder Theorem. In the same way 
in the second case there are O(^) choices for a and b. Thus the first sum in (13.6|) is bounded by 

AB -i+e+e 

{AB)^ q 

Now we consider the second sum in (13.61) . We must show that 

q^ 


(3.7) 


{AB)i 


E 

A<a<2A 
B<b<2B 
{ab,p) — l 


1 


P 

mod 

is bounded by a negative power of q. We may assume that 

_i A 1 
q ^ < — <q^, 

since otherwise the proof is complete. To see this, suppose without loss of generality that A < B. 
Then for each of the A choices of a in (|3.7L there are 0(1 + possibilities of b which satisfy the 

Thus if (13.81) is not satisfied, then (13.71) is bounded by 


(3.8) 


congruence ^ = bP ^ mod p 


(3.9) 


q^ 


{AB) 


(a 

-U + — 

2 V q ^ 


AB\ _i+e 

I ^ r, 4+^ 




.,-4+e+e 


Now, assuming (ITSI) . we analyze (13.7|) according to the sizes of A and B as follows. Let 0 < 5 < ^ 
and Q = 95 - 2 - 5 . 


Case 1. Suppose that 2A <92 ^ and 2B <92 Every term in ()3.7I1 would have to satisfy 
a = bC,j mod p^~^ for some Q G Mp-i \ {il}) and hence |a — 6 CjIp ^ p~^^^ with |a|, |6| < (p^)^~^. 
For sufficiently large k, however, by Lemma 12.61 there are no such nonzero integers a and b, and the 
corresponding sum (ITtI) is actually empty. 
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Case 2. Suppose that 2A > qi~^ and 2B < qi~^. (The case 2A < qi~^ and 2B > q^~^ is treated 
similarly). Dividing the dyadic interval A < a < 2A into smaller pieces of length Q, we may bound 
(13.71) by 

^ El- 

^ ^ l<u<4,A+(u-l)Q<a<A+uQ 

^ B<b<2B 

(ab,p) — l 
aP-i=f,P-i mod 
a^itb mod 


By Lemma 12.71 the innermost sum of (13.101) is, for sufficiently large k, bounded by a constant 
(depending on p). Using this fact and (13.81) . we see that (13.101) is bounded by 


(3.11) 


q^ A 
(AB)i Q 




— ^-1-2(5-t-e 


This falls into the error term of Proposition [2^ as we may take S to be as small as we like. 


Case 3. Suppose that 2A > q^ ^ and 2B > qi ^. Dividing the dyadic intervals A < a < 2A and 
B < b < 2B into smaller pieces of length Q, we may rewrite (j3.7ll as 


(3.12) 


^ ^ l<u<^, A-t-('u—l)Q<a<A+?iQ 

B-|-(ii—l)Q<b<B+t)Q 
- (ah,p) = l 

aP-i=f,P”i mod 
a^±b mod 


By Lemma 12.71 the innermost sum of (13.101) is, for sufficiently large k, bounded by a constant 
(depending on p). Using this fact and (13.511 . we see that (13.1211 is bounded by 


(3.13) 


q^ AB 
{AB)^ 'W 




^-t-0-t-4b-t-e 


This falls into the error term of Proposition [2^ as we may take 6 to be as small as we like. 
The proof of Proposition 12.51 is now complete. 


□ 


3.3. The case of thin orbits. In this section, we prove Theorem 11.31 As is customary in analytic 
number theory (and as is already the case with full Galois orbits of primitive characters), the 
principal change introduced by the shrinking family of characters in the orbit is that more 
terms survive averaging over the family. We quantify this effect with the following modification of 
Lemma 12.31 on orthogonality relations. 

Lemma 3.1. Let q = p^ for an odd prime p, let 0 < k < k — 1, and let be a thin Calois orbit 
of primitive Diriehlet eharacters mod q. For any integer n, we have that 

(3.14) ^ X(n) = 0 


unless 

(3.15) 

where k = min(K, k — 2). 


= lmod/+\ 
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Proof. Fix a character xo S On- Fix a generator g of the cyclic group and write 

Xoin) = foi' some 7 . In particular, ( 7 , 4>{q)) = {p — l)/d, where d \ (p — 1) is the order of xo and 
of all characters in 0 ^, so that the corresponding full orbit O 3 has cardinality given by (O. 

If n is an integer divisible by p, the lemma is trivially true. We therefore assume n is relatively 
prime to p and let 0 < r < p^~^{p — I) be such that n = in (Z/p^Z)^. From (I1.12L it is 
immediate that 


xeo^ 


E 

a mod ^(p—1) 


^'yra _ ^ 


'xo{n) e{xrj/p^), 

0<j<p‘^ 


a=l mod p 


'(p-i) 


Xoin)P e{xrj/p'^ ^), 

0<j<p’‘-^ ,p\j 


0 ^ K < k — 
K = k — 1. 


In either case, the resulting sum vanishes unless p” | r. The condition p'^ | r is equivalent to 
jiP ^ ''(p-i) = I modp^, which by Lemma 12.21 implies that = 1 modp'^^^. This proves the 
lemma. □ 


We will only use (13.151) as a condition modulo p'‘. In particular, note that the localization (12.61) is 
achieved already by averaging over x S Ok-i- The resulting Ramanujan sum in the case k = k — 1 
should be compared with the explicit evaluation (12.81) . 

The analogs of Propositions 12.41 and 12.51 on mollified moments are as follows: 

Proposition 3.2. Let q = p^ for an odd prime p, let k/2 < k < k — 1, and let Ok, be a thin Galois 
orbit of primitive Dirichlet characters mod q. For 0 < 0 < 2(^ — i) in p.8l) . we have 

E Lihx)M{x) = l + 0{q-^+^ + qi + ^-^+^), 

while, for O< 0 <^ — ^ in (11.81) . we have 


\Ok 


\Hix)mix)\^ = 


p -1 


E 






p [mi, m 2 ] 

mi ,m2^q 
{mim2,q) — l 


(log( 


g(mi,m 2 )^ 

7Tmim2 


)+c) 




with notations as in Provosition \2.5[ 


Proof. The proof follows the proofs of Propositions l2.4l and l2.51 with Lemma l3TT] as the orthogonality 
relation in place of Lemma 12.31 

For the first mollified moment, we start by inserting the approximate functional equation as in 
en). By Lemma 13.11 the character average isolates the main term, which comes from mn = 1 
and is identical as before, and summands with s = mn = Q modp^, s > 1 , which we split into 
two terms, corresponding to the ranges 1 < s < p'^ and p'^ < s < ^1+-’'+®+'^. By Lemma [2.61 for 
sufficiently large fc, all summands in the first sum satisfy s > {p'^)'^~^, and in total the first error 
term is 0 ((p '^)“4 + 2 ‘ 5 ]). Writing s = Q +p'^r, the second error term is similarly bounded by 

q 2^ ip r) 2 < g = 

completing the proof of the first part of our proposition. 

As for the second mollified moment, we start by expanding using the same functional equa¬ 
tion (13.31) . The main term arises from the diagonal terms, when nimi = n 2 TO 2 , and is the same 
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as above, while in the off-diagonal terms we are left by Lemma |3 .1 1 with the sums as in (EH) now 
subject to modp'‘, and which we further split as in (13.6|) . In the first sum, over 

a = ±b mod p”, we must have 2A > p'^ or 2B > p”, and the total contribution from these terms is 
bounded by 


AB 


<9 




{AB)i P'* 

In the second sum, analogously as before, we may assume that < A/B < pt, since otherwise the 
proof is complete as in (13.91) . We now set Q = [p^)^~'^^ and consider three cases. If 2A < {p'^)^~^ 
and 2B < , then the sum is empty for sufficiently large k by Lemma [2.61 If 2A > {p'^)^~^ 

and 2B < (p'') = then by splitting the a-sum into intervals of length Q the sum is bounded as in 
(|3.11D by 

g" A 


and, finally, if both 2A > (p”) 2 


_(7-3F+2'5-l-e 

{AB)iQ ^ 

and 2B > {p'^)^~^, then the sum is bounded as in (13.131) by 


{AB)i 

which completes the proof as we may take 6 as small as we wish. □ 

Theorem ll.3l now follows from Proposition [32] in the same way in which Theorem 1 1 . 1 1 is deduced 
from Propositions 12.41 and 12.51 by taking t f — 5 ■ 
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